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STELLAR MASS-TO-LIGHT RATIOS FROM GALAXY SPECTRA: HOW ACCURATE CAN THEY BE? 
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ABSTRACT 

Stellar masses play a crucial role in the exploration of galaxy properties and the evolution of the galaxy pop- 
ulation. In this paper, we explore the minimum possible uncertainties in stellar mass-to-light (M*/L) ratios from 
the assumed star formation history (SFH) and metallicity distribution, with the goals of providing a minimum 
set of requirements for observational studies. We use a large Monte Carlo library of SFHs to study as a function 
of galaxy spectral type and signal-to-noise ratio (S/N) the statistical uncertainties of M*/L values using either 
absorption-line data or broad band colors. The accuracy of M*/L estimates can be significantly improved by using 
metal-sensitive indices in combination with age-sensitive indices, in particular for galaxies with intermediate-age 
or young stellar populations. While M*/L accuracy clearly depends on the spectral S/N ratio, there is no significant 
gain in improving the S/N much above 50/pix and limiting uncertainties of ~ 0.03 dex are reached. Assuming that 
dust is accurately corrected or absent and that the redshift is known, color-based M»/L estimates are only slightly 
more uncertain than spectroscopic estimates (at comparable spectroscopic and photometric quality), but are more 
easily affected by systematic biases. This is the case in particular for galaxies with bursty SFHs (high H6a at fixed 
D4000 n ), the M*/L of which cannot be constrained any better than ~ 0.15 dex with any indicators explored here. 
Finally, we explore the effects of the assumed prior distribution in SFHs and metallicity, finding them to be higher 
for color-based estimates. 

Subject headings: galaxies: fundamental parameters, galaxies: stellar content, methods: data analysis 



1. INTRODUCTION 

The study of the galaxy population as a function of their stel- 
lar masses has become increasingly common and important in 
the last years. For a long time, galaxy scaling relations were 
studied in terms of galaxy luminosities, often in the optical 
regime where differences in star formation histories between 
different galaxies would lead to factors of 3-10 difference in 
luminosity for a given stellar mass. Comparison of galaxies 
with different star formation histories and similar luminosities 
is therefore not a well-posed exercise, because it translates into 
comparing galaxies of dramatically different stellar mass. 

This situation has significantly improved in the last decade 
thanks to the development and continuous refinement of meth- 
ods to estimate the stellar mass-to-light ratio (M*/L) from 
galaxies spectro-photometric properties. All the various tech- 
niques involve comparison with predictions from population 
synthesis models. The observational constraints used to derive 
galaxy M*/L estim ates range from relatively 'cheap' observa- 
tions of colors (e.g. iBrinchmann & Ellis 2000; Papo vich et al.1 
1200 It iBell & de Jondl2001t ICole et alJl2001t iBell et al.ll2003l) . 
to broad-band spectral energy distributions (SEDs) exte nding 
from the optical or UV up to the IR (e.g . iBorch et all [2006 ; 
IWalcher et al.1 l2008t iFranzetti et al.1 120081) . to spectroscopic 
measurements of individual stellar absorp tion features (e.g. 
iKauffmann et al.ll2003at iGallazzi et alj|2005l) or the full opti- 
cal galaxy spectrum (e.g.lPanter et al.ll2004tlToieiro et al.l2007t 
ICid Fernandes et alj|2007l) . 

The technical development has proceeded in parallel with 
(and has been largely motivated and made possible by) the rise 
of large homogenous spectroscopic and photometri c datasets 
from surveys such as the Sloan Digital Sky Survey ([Y ork et al. 
120001) , the Two Micron All-Sky Survey (ISkrutskie et al .1120061) . 
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COMBO- 17 survey dWolf et al.l l200l. VIMO S VLT Deep 
Surve y dLe Fevre et all 120051) . and COSMOS dScoville et al.l 
2007). The application of stellar mass estimation techniques 
to these datasets has allowed the community to make rapid 
progress towards understand ing the properties and evolution of 
the galaxy population (e.g.,[K auffman n et al.l200 3b: Shen et all 
2003; Bal drv et alJl2006tlBeU et alJl2003tlFontana et al.ll2006l) 7 
While it is generally acknowledged that ranking galaxies by 
their estimated stellar masses rather than observed luminosi- 
ties provides a more physical insight, it is also recognized that 
in translating observational quantities into physical parameters 
estimates, we are limited by a number of statistical and system- 
atic uncertainties. To start with, the accuracy of physical pa- 
rameters estimates clearly relies on the accuracy of the popula- 
tion synthesis models used to interpret observations (i.e. galaxy 
colors or spectra). In particular differences in the stellar li- 
braries (either empirical or theoretical) and uncertain (not-well 
observationally constrained and/or not- well theoretically under- 
stood) aspects of particular stellar evolutionary phases can af- 
fect the predicted SED for a given star formation history (SFH). 
Among these, it is recognized that thermally-pulsating asymp- 
totic giant branch stars (TP-AGB) can have a significant impact 
on M*/L values estimated from near-infrared (NIR) colors of 
young stellar populations (~ 0.5 -2Gyr). For inst ance, M*/L 
of high-redshift galaxies based on Maraston (2005) models can 
be low er by 40-60% than those based on iBruzual & Chariot! 
(2003) code because of the different flux contribution by stars in 
this particular phas e (e.g. jMaraston et aT]|2006t |Bruzualir2007l: 
ICimatti et al. 2008). Blue stragglers and blue horizontal branch 
morphologies can instead affect (in an age-dependent fashion) 
the SEDs, and derived properties, of passive galaxies. More- 
over, the treatment of non-solar abundance ratios in popula- 
tion synthesis models can affect the stellar ages (and hence 
M^/ L) of early-type gal axies derived from spectral features 
(e.g.. lThomas et aLll2004l) . These and other stell ar populations 
uncertainties have been extensively explored in IConrov et al.1 
d2008ll2009h . 
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Dust attenuation is a key uncertainty in stellar M/L val- 
ues, especially tho se derived usi ng col or informatio n (e.g ., 
IPozzetti et all 120071 iFontana etal]|2004 IZibetti etalj|2009l - 
althou gh dust can affec t absorption features like the 4000A- 



break, lMacArthurll2005l) . It has been argued tha t dust should to 
first o rder cancel out in the total estimated dBell & de Jong 
2001). Yet, there are a number of caveats and possible excep- 
tions to this simplistic picture. In particular, for systems with 
high optical depth (either globally or in patches), there can be 
consider able absorption of light without l a rge amounts ofred- 
dening dde Jonglll996l iDriver et all [20071: IZibetti et al.ll2009l) . 
If the attenuation is patchy, moving towards resolved mass 
maps, and then integrating t he result to a total mass, reduces 
this systematic uncertainty (Abra ham et al.|[l99^Zibetti et alj 
2009 jfl Modeling of the effects of dust can help dde Jond 19961: 
Driv er et alj 120 07), where total thermal infr ared flux can be 
of some use in constrainin g the model (e.g., Popescu & Tuffs 
2005: Ida Cunha et al.ll2008l) . 

The predicted photometric properties and M*/L of galax- 
ies also strongly depend on the assumed initial mass function 
(IMF). While it is sometimes possible to convert M*/L val- 
ues determined assuming a given IMF into another one with a 
simple scaling factor when the differences are only at masses 
< IMq, IMFs that are different at masses > 1M Q can alter the 
galaxy SED in an age-dependent way and hence have a dif- 
ferent gl obal effect on the c olors of star-forming and passive 
galaxies dConrov et al.l 120091) . Generally, studies of the aver- 
age slope of the high-mass (> 1 M ) part o f the stellar IMF 
determine slopes cons i stent with Salpeter (e.g. Kennicutt 1983; 
iBaldrv & Glazebrookl 120031: iHoversten & Glazebrookl 120081) . 
There are a number of studies that rely on chemical abun- 
dances that sometimes favor slightly top-heav i er IMFs (e.g. 



Worthev et alj|1992t iGibson & Matteu cci)ll997t iThomas et all 

If, 



1999t " iTrager et al.l l2000h lArrigoni et al.l 120091): yet, note that 



chemical modeling of galaxy evolution is uncertain (owing to 
uncertainties in e.g., yields, gas infall and outflow) and such 
results should be viewed as being interesting but necessarily 
tentative in nature. 

Moreover it is generally assumed that the shape of the IMF 
is universal and constant in time. Dynamical and strong lens- 
ing estimates of total M/L on galactic scales (i.e., < half- 
light radius scales) have historica ll y prov en compatib l e with 
a universally-applicable Chabrie ri d2003l) or iRroupal d200ll) 
stellar IMF (e.g.. iBell & de Jongjl200U ICappellari et aLl l2006: 
iGaUazziet at] 120061: iFerreraset al J 120081 R. S. de Jong & E. 
F. Bell, in preparation); such an IMF is motivated by local 
observations of the luminosity/mass function of star clus- 
ters, and is consistent with the statistics of Ha equivalent 
width as a function of galaxy color for luminous galaxies 
dHoversten & G lazebrook 2008). There have been a few recent 
arguments that, if borne out by further analysis and observa- 
tions, may sugges t a time-dependent s tellar IMF dWilkins et al.l 
20081 lDave1l2008l: Ivan DokkumH2008l) . Furthermore, statistics 
of Ha equivalent width as a function of galaxy color/UV flux 
show a t endency towards fewer very m a ssive stars in dwarf 
galaxies dHoversten & Glazebrookl 120081: iMeurer et alj 12009) 
that may indicate a change in the galaxy-averaged stellar IMF 
(as opposed to the IMF o f individual clusters) in l ow-luminosity 
galaxies (see also, e.g. JWeidner & Kroupall2005l) . It has to be 

3 This is an advantage of color-based stellar M/L estimates over spectroscopic 
values: the spectra to be analyzed typically are only of a part of a galaxy, 
whereas imaging exists across the face of the galaxy and one can account for 
optically-thick parts separately, if one wishes. 



noted however that Elmegree^ d2006l) found no evidence of 
the composite IMF being steeper that individual cluster IMF, 
supporting the equality of the IMF from small to large scales. 

Finally, we note that the derived M*/L estimates depend on 
the assumed distribution in SFHs of the models used to interpret 
galaxy SEDs. In particular it has been shown that the addition 
of bursts of star formation on top of a continuous SFH can pro- 
duce M* /L estimates systematically different by an amount that 
can vary between ~10% and a factor of two depending on the 
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strength and fract i on of starburst s (e.g., IBell & de Jond 
Drorv et all 12004 IPozzetti et ail 120071: iGallazzi et al.1 
Wuvts et aU2009l) . In the last few years several works have put 
significant efforts in quantifying the overall statistical and sys- 
tematic uncertainties on M*/L estimates and quantities based 
on them, such as stellar mass functions and cosmic stellar mass 
densi t ies (to mention a few ICimatti et al.ll2008t IGallazzi et al.l 
2008; Ma rchesini et al.ll2008t lLonghetti & Saraccoll2009l) . The 
general impression from all such works is that it is difficult to 
defend M„/L estimates to much better than 0.1 dex under the 
most ideal conditions, and perhaps even worse if more extreme 
conditions are allowed as discussed above (e.g. if there are 
systematic and/or random stellar IMF variations from galaxy to 
galaxy, if there are frequent prominent bursts). 

In this paper, we take a somewhat orthogonal approach to the 
question of M*/L accuracies. In particular we neglect here un- 
certainties related to the physics of population synthesis models 
and assume that we have a 'perfect' model and that the IMF is 
indeed universally-applicable. We further assume that dust can 
be ignored, either because dealing with dust-free systems or 
because it can be 'perfectly' corrected for. The motivation for 
making such ideal assumptions is to isolate the contribution of 
SFH and metallicity scatter to the M*/L error budget. By doing 
so we wish to explore in depth the following issues: i) how does 
M* /L accuracy depend on the choice of stellar population diag- 
nostics used to estimate its value; ii) how does M*/L accuracy 
respond to increasing spectroscopic or photometric quality; iii) 
how do the answers to these questions depend on the galaxy 
SFH (or spectral type). 

While at first sight our approach may seem somewhat no- 
tional, given the aforementioned uncertainties in stellar popu- 
lation models and concerns about the possibility of stellar IMF 
variations, yet in isolating the influence of SFH and metallicity 
variations on M*/L values one can gain insight into a number 
of issues. 

• How much effort should be placed in creating realis- 
tic and physically-motivated grids of SFH and metallic- 
ity for use in stellar mass estimation? If variations in 
SFH and metallicity produce little scatter in M*/L val- 
ues, then little effort needs to be expended in producing 
realistic templates. If SFH/metallicity variations lead to 
large scatter in stellar M/L, even for excellent data, then 
the creation of realistic priors for the SFH and metallic- 
ity distributions acquires some urgency. 

• When should we stop trying to improve the models? For 
example, if a particular source of uncertainty in a stellar 
population model gives an improvement that is much 
smaller than the minimum possible uncertainty from 
plausible variations in M»/L from SFH/metallicities, 
then one could argue that that particular improvement in 
stellar population model is less urgent than other sources 
of uncertainty. 
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• How good does my data need to be? An investigation of 
the scatter in M*/L from SFH/metallicity variations at a 
given set of colors or line indices, for a given data qual- 
ity, gives a lower limit to the M*/L uncertainties achiev- 
able with such data. Given a target M*/L uncertainty 
and method, one then learns what the target data quality 
should be (for effects related to SFH/metallicities only). 
This is less of a driver of this work, as higher quality or 
complementary data may be necessary to address other 
sources of uncertainty (such as stellar population syn- 
thesis model uncertainties or IMF variations); nonethe- 
less, this work helps to build intuition as to what kind 
of data quality might be required to produce accurate 
M*/L estimates, assuming that the other sources of un- 
certainty are dealt with adequately. 

In order to address these issues we study in some detail a 
large Monte Carlo library of SFHs described in Section |2~Tl that 
cover the spectroscopic and photometric properties of present- 
day galaxies. We randomly select a subset of models that rep- 
resent different galaxy spectral types having different SFHs, 
and we create mock galaxy samples by perturbing their spec- 
tra according to a given signal-to-noise ratio (S/N). We follow a 
Bayesian approach to derive the likelihood distribution of M*/L 
for these mock galaxies adopting as observational constraints 
several sets of absorption indices as outlined in Section 12.21 
In Section [3] we discuss the dependence of spectroscopically- 
derived M*/L estimates on the absorption indices used, on the 
spectroscopic S/N, on the SFH scatter, both in general and as 
a function of galaxy spectral type. We then compare these re- 
sults to those concerning M»/L estimates based on optical or 
optical-NIR colors in Section|4] In Section[5]we explore the ef- 
fects on M*/L estimates of any mismatch between the assumed 
SFH and metallicity prior and the true distribution. We finally 
present our conclusions in Section[6] Throughout this work we 
adopt a lChabrier! d2003l) IMF with mass cut-offs of 0. 1 M and 
100 M Q and we refer to the z-band stellar mass-to-light ratio. 

2. THE METHOD 
2.1. The library of star formation histories 

The library of star formation histories (SFH s ) that we exploit 
in this work is the one used in iGallazzi et al. I (120051) to derive 
stellar population parameters of SDSS galaxies. The library 
consists of 150000 Monte Carlo SFHs modeled by an exponen- 
tially declining SFR (with varying timescale and time of onset 
of star formation) to which random bursts of star formation can 
be superposed (with varying time of onset, duration and frac- 
tion of mass produced during the burst). The probability of 
having a burst is set such that 10% of the models in the library 
experience a burst in the last 2 Gyr. The metallicity varies be- 
tween 0.02 x Zq and 2.5 x Z Q but is kept fixed along each SFH 
(i.e. no chemical evolution is implemented). The galaxies in the 
library are also characterised by different stellar velocity disper- 
sions for comparison with observed absorption features whose 
strength depends on the velocity dispersion broadening. A uni- 
form prior distribution is assumed for all the parameters defin- 
ing the model galaxies. A discussion on the potential effects 
of the prior ass umptions (in particular the fraction of bursts) 
can be found in Gallazzi et alJ d2005ll2008h and will be further 
addressed in Section|5]concerning M*/L estimates. 

For each SFH we measure the resulting spectral absorption 
features, broad-band colors, luminosity-weighted ages and stel- 
lar mass-to-light ratios (accounting for the returned mass frac- 



tion) by convolving lBruzual & Charlotl d2003l hereafter BC03) 
simple stellar populations (SSP) with each SFH. The models 
cover the wavelength range from 91 A to 160/im, with 3 A res- 
olution over the range 3200-9500A. 

This library purely describes the stellar continuum spectrum 
for each SFH, while no account is taken of dust absorption. 
Stellar absorption features, used to estimate stellar mass-to- 
light ratios as well as luminosity-weighted ages and metallic - 
ities, are weakly affected by dust (but see iMacArthurl 12005. 
for a detailed analysis on the dependence of absorption index 
strengths on dust). We also do not include any treatment of 
emission lines from ionized gas in our model library. In ob- 
served spectra of star-forming galaxies emission from ionised 
gas can affect in particular the measurement of the Balmer 
absorption features. It is crucial that emission lines are care- 
fully corrected for in order to measure the true stellar absorp- 
tion and hence derive an unbiased measure of the stellar age. 
Medium/high spectral resolution and good signal-to-noise ratio 
(S/N) are particularly crucial in this respect. 

The distribution of the models in the D4000 n -H(5A plane is 
shown in Fig.[T](left panel). This plane is an observational diag- 
nosti c of recent star formatio n history (as already discussed in, 
e.g., Kauffmann et al. 2003a). Models without bursts or which 
had a burst more than ~ 6 Gyr ago ('continuous' models, i.e. 
dominated by an exponentially declining SFH) form a continu- 
ous sequence of decreasing HSa at increasing D4000 n , reflect- 
ing an increase in the luminosity-weighted age. The region of 
relatively high H^ values at intermediate D4000 n is occupied 
by models with more than one burst during their SFH ('bursty' 
models) and, in particular, which experienced a burst in the last 
2 Gyrs. Higher burst strengths produce higher HSa values at 
fixed D4000 n , resulting in a trend of decreasing luminosity- 
weighted age at increasing HSa at fixed D4000 n in the range 
1.1 < D4000 n < 1.8. Finally, models that reach very low 
D4000 n and low HSa values had a very recent (< 100 Myr) and 
intense (> 30% in mass with respect to the und erlying contin- 
uous SFH) burst of star formation. As shown in IGallazzi et al.l 
(120051) . the stellar mass, luminosity-weighted age and stellar 
metallicity of observed z = galaxies all increase with increas- 
ing D4000 n . At intermediate D4000 n values, where the relation 
with stellar mass is weaker, there is a broader range in ages, 
which decrease at increasing HSa, and in metallicities, which 
increase with increasing HSa- 

In this work we wish to explore our ability of constraining 
M*/L also in dependence of the galaxy SFH (or the galaxy 
spectral type). For this purpose the models that we choose to 
analyse as real observed galaxies are randomly drawn from five 
different locations on the D4000 n -H<5 J 4 plane (as indicated by 
the boxes in the left panel Fig. [TJ: continuous models dom- 
inated by either old, intermediate-age or young stellar popu- 
lations (red, orange and blue, respectively), and bursty mod- 
els with either intermediate or young ages (green and cyan, 
respectively). The z-band mass-to-light ratios (log(M*/L z )), 
luminosity-weighted ages (log(t r /yr)) and stellar metallicities 
(log(Z*/Z Q )) of the different classes of models are shown in 
the right panel of Fig. Q] There is a clear sequence of decreasing 
both log(M*/L z ) and log(t,/yr) with decreasing D4000 n and 
increasing HSa (i-e. from 'continuous old' to 'bursty young' 
types). Besides a continuous shift in the average mass-to- 
light ratio and stellar age, the distributions in these parame- 
ters tend to become slightly broader going from the homoge- 
neous 'continuous old' populations to more complex SFHs (and 
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more composite stellar populations). This is even more the 
case when stellar metallicity is considered. While the metal- 
licity of galaxies with strong D4000 n is typically solar and al- 
ways greater than 40% solar (because the 4000A-break is some- 
what metallicity-sensitive, and high values of D4000 n can only 
be reached in systems with high metallicity), the other galaxy 
types have a much broader metallicity distribution, even cover- 
ing the whole range of the models for the youngest populations. 
At intermediate ages (corresponding to intermediate D4000 n 
values) recent bursts of star formation produce higher metal- 
licities than continuous models (compare green and orange his- 
tograms), reflecting the anticorrelation between metallicity and 
(and age) in this D4000 n range as mentioned above. 

2.2. Stellar mass-to-light ratio estimates from spectral 
features 

As mentioned before, we create mock galaxies from a total 
of 553 models randomly chosen in order to represent different 
spectral types and SFHs. Mock galaxies are created by assign- 
ing an error to the absorption features typical of observed spec- 
tra with a given signal-to-noise ratio (S/N) and by perturbing 
the true index strengths according to this error. This proce- 
dure is repeated 100 times for each S/N value. Each time we 
calculate the probability density function (PDF) of log(M*/L z ) 
by comparing the 'observed' absorption indices with those pre- 
dicted by all the other models in the library with the same ve- 
locity dispersion (within 15 km s" 1 ). We then take the average 
of these 100 Monte Carlo realizations of the PDF as the final 
log(M*/L z ) likelihood distribution for a given galaxy at a given 
S/N. 

We consider nine values of S/N (namely 2, 5, 10, 20, 30, 40, 
50, 100 and 200). The typical error of each absorption feature 
at a given S/N is taken to be the average error of SDSS DR4 
galaxies in the redshift range 0.05 < z < 0.22 that have a spec- 
trum with the corresponding S/N per pixel. No SDSS spectra 
with S/N of 100 and 200 are available. We then fit a linear 
relationship between the error and the S/N (in log -log space) 
and extrapolate to S/N values of 100 and 200. We note that the 
average errors on the absorption features (and their extrapola- 
tion to the highest S/N values) are almost independent of the 
galaxy velocity dispersion, especially at S/N> 10. The index 
errors estimated in this way are consistent with those measured 
directly from template spectra perturbed with Gaussian noise (a 
reasonable assumption at S/N > 20). 

The PDF provides information on how plausible it is that 
the parameter of interest lies within a certain range of val- 
ues, given the constraints from the data (the absorption in- 
dices) on the assumed prior distribution of models. The PDF 
can be characterized by its median, which we take as our esti- 
mate of log(M*/L z ), and by a confidence interval within which 
log(M*/L z ) is constrained at a certain probability level. 

Throughout the paper we quantify the uncertainty of M*/L 
estimates in two ways: i) the offset between the retrieved M*/L 
value (i.e. the median of the PDF) and the true one, which 
quantifies the typical bias introduced by the method, together 
with its rms scatter; ii) the confidence interval given by half of 
the 16%-84% percentile range of the likelihood distribution for 
each mock galaxy (which is equivalent to la range for a gaus- 
sian distribution). While the rms of the offsets tells how much 
the median estimate scatters as a consequence of observational 
errors (so it is in a sense an estimate of the repeatability of the 
measure), the 68% range of the PDF gives the range of accept- 



able values of the parameter, which depends on the sensitivity 
of the data to the parameter. We note that the rms is in gen- 
eral lower or equal to the width of the PDF. In any case, neither 
quantity can be smaller than the intrinsic scatter in M*/L within 
a certain range of observables (as we discuss in Section [33T >. 

We wish to explore the dependence of log(M*/L z ) uncer- 
tainties not only on galaxy spectral type and spectral quality, 
but also on the observational constraints adopted to build the 
likelihood distribution. To this purpose, we consider several 
combinations of absorption features that represent situations in 
which different portions of the optical spectrum (hence differ- 
ent information on the underlying stellar populations) are ex- 
ploited. The absorption features analysed her e are those in- 
cluded in the Lick/IDS system and defined in Worthev et alj 
( 119941) . the 4000A-break D4000 n as defined in iBalogh et al.1 
(1999J) and the high-order Balmer lines HSa and Hja as de- 
fined in I Worthev & Ottavianil d 1997b . The seven sets of indices 
we consider are: 

1. D4 000 n and Hi5a (ca s e 1). T hese are the indices used 
by iKauffmann et al.l ((2003a). As discussed above, 
these two indices provide a diagnostic of the recent 
SFH. In particular, the location in the D4000 n -H<5 A 
plane is determined in first approximation by the av- 
erage stellar age, while it is less sensitive to metal- 
licity. They have indeed been shown to provide con- 
straints on stellar mass-to-light ratios and stellar ages 
with lcr errors within 30% for average SDSS spectra, 
whi le they are not sufficient to constrain stellar me tallic- 
ity (IKauffmann et al.ll2003at iGallazzi et al1l2005l) . The 
metallicity insensitivity of these indices starts to break 
down for old stellar ages. 

2. D4000 n , US A , Hja and H/3 (case 2). This set of indices 
is equivalent to the previous one, but it adds information 
from the other Balmer lines. Even assuming that all the 
Balmer lines have the same sensitivity to age (and in- 
sensitivity to metallicity) we would expect a (slight) im- 
provement over the use of one single line. The expected 
improvement could be more significant if we consider 
that Balmer lines are not pure age indicators. Unlike 
H/3, which shows little sensitivity to both total metallic- 
ity and abundance ratios, the higher-order Balmer lines 
H8a and Hja are more strongly affected by metallic 
lines in their bandpasses. They are thus sensitive to 
element abundance ratios especially at high metallici- 
ties, in particul ar to variations in abundance of C and 
N, respectively dKorn et alj2005l:|Prochaska et alj2007l: 
ISch iavon 2007). The sensitivity of Balmer lines to metal 
abundances is relevant here in that it adds further infor- 
mation helping in better constraining the log(M*/L z ). 

3. D4000 n , US A , H 7a , H^, [MgFe]' and [Mg ? Fe1 (case 
3). Th is set of indices has been used in IGallazzi et al.l 
(2005) to derive constraints on stellar metallicity, light- 
weighted ages and stellar masses of SDSS DR2 galax- 
ies. The indices [MgFe]' and [Mg2Fe] are crucial to get 
meaningful constraints on stellar metalli city. They are 
insensitive to element ab undance ratios (Tho mas et al.l 
l2003tlCoelho et al.l2 007) and are thus appropriate when 
comparing observed spectra of (potentially a-enhanced) 
ellipticals to scaled-solar population synthesis models. 
We explore here the impact of adding metallicity con- 
straints on the quality of log(M*/L z ) constraints. 
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FIG. 1 . — Left: Distribution of model galaxies in the D4000 n -H<5 / i plane. Squares indicate the location of the analysed galaxies. Galaxies populating different 
positions in the plane are characterized by different star formation histories: continuous exponential star formation with age decreasing with decreasing D4000 n 
(circle: old; square: intermediate; diamond: young) as opposed to a bursty SFH (triangle: intermediate age; asterisk: young). Right: Stellar mass-to-light ratio 
(log(M* /L z )), light-weighted age (log(f,-/yr)) and stellar metallicity (log(Z„/ZQ)) distribution of the analysed galaxies divided into the five different SFH types. 



4. D4000 n , US a, H 7A , H/3, Mg 2 , Mgb, Fe4531, Fe5015, 
Fe5270 and Fe5335 (case 4). This set of indices is 
equivalent to the previous one, but the individual ab- 
sorption features that define the composite [MgFe]' and 
[Mg2Fe] indices are used. Unlike the composite indices, 
the individual Mg and Fe absorption lines are sensitive 
to a/Fe abundance ratio variations. They need to be 
compared to models that treat variations in individual 
element abundances, when analysing real galaxies. In 
this case they might help narrowing further log(M*/L z ) 
uncertainties because of the additional a/Fe informa- 
tion. This is not the case in this work where only scaled- 
solar spectra are considered. This alternative set is used 
here only to explore the effect of additional observa- 
tional constraints (but not additional astrophysical in- 
formation). 

5. D4000„, HS A , Ca4227, G4300 and Ca4455 (case 5). 
This set combines age-sensitive indices and metal- 
sensitive indices, as in the previous two cases, but ex- 
ploits only the bluest part of the optical spectrum. It 
may thus be of interest for higher-redshift studies in 
which the red absorption features are difficult to mea- 
sure because of telluric absorption and OH skyglow. 
Ca4227 is sensitive to C and N as well as Ca abun- 
dance. G4300 is particularly sensitive to C and O and 
only little to Fe abundance. The only Fe-sensitive index 
in this group is Ca4455 which also shows an opposite 
dependence on Cr and, despite its name , no sensitivity 
to Ca abundance (iTripicco & Belllll995l) . Both G4300 
and Ca4455 are almost independent of [a/Fe] and their 
strengths in observed galaxy spectra are reasonably well 
fitted by the scaled-sola r BC03 models (see fig. 18 of 
IBruzual & Charlotll2003l) . 

6. H/3, Mg 2 , Mgb, Fe5270, Fe5335, Fe5406, Fe5709 and 
Fe5789 (case 6). As opposed to the previous case, with 



this index set we wish to explore our ability of constrain- 
ing log(M*/L z ) by using only the red portion of the op- 
tical spectrum, in particular excluding the 4000A-break. 
Only H/3 is available as age-sensitive index. In this case, 
log(M*/L z ) is determined primarily by narrowing the 
metallicity range of the stellar populations. A series of 
elements, including C and Fe in addition to Mg, con- 
tribute to the strength of both Mg2 and Mgb. They re- 
spond to [a/Fe] variations, but are still reasonably well 
reproduced by scaled-sol ar models, as opposed to Mgi 
(IBruzual & Charlotl2003l) . Fe5270, Fe5335 and Fe5406 
are all similar indicators of Fe abundance. Fe5709 
shows a weaker sensitivity to Fe, counter-balanced by 
its sensitivity to Ca. It is hence little affected by a- 
enhancement, which explains why it is well reproduced 
by BC03 models. This is also the case of Fe5789 whose 
strengt h is mostly contributed by Cr and only marg inally 
bv Fe. dTripicco & Belli 9951: iThomas etaUl2003l) . 

7. H/3, Mg 2 , Fe5709 and Fe5789 (case 7). This set is 
equivalent to the previous one but keeps only a limited 
number of indices, chosen among those that have the 
weakest sensiti vity to [a /Fe] and are best reproduced by 
BC03 models dThomas et alj l2003t IBruzual & Charlotl 
120031) . 

3. RESULTS 

3.1. Dependence on observational constraints 

In this section we explore how the uncertainties on the esti- 
mated log(M„/L z ) depend on both the spectral quality, as ex- 
pressed by the mean S/N per pixel, and the absorption features 
used to constrain the likelihood distribution. 

We first checked that for the sample as a whole the median 
offset between the estimated and the true M*/L is zero (we 
will discuss it for individual spectral classes in the following 
sections). We now turn our attention to the median lcr error 
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on log(M*/L z ) (defined as half of the 16-84 interpercentile 
range of each PDF), which is shown in Fig. [2] for all the 553 
galaxies analysed as a function of S/N. The mass-to-light ra- 
tio estimates are obtained with only D4000 n and H6a (case 1). 
The error bars indicate the scatter in log(M*/L z ) error at given 
S/N. From the plot it is clear that the mean S/N of the spectrum 
strongly influences the accuracy of spectroscopic log(M*/L z ) 
estimates and a S/N > 30 is required to constrain log(M»/L z ) 
within better than 30% when only two spectral indices are used. 
However, at higher S/N an improvement in S/N of a factor of 
2 produces only a mild improvement in the log(M»/L z ) uncer- 
tainties. There seems to be a limiting accuracy of ~ 0.07 dex 
even at S/N of 200. 

In Fig.[3]we explore whether the uncertainty on log(M*/L z ) 
estimates can be lowered by using additional spectroscopic in- 
formation. Each panel shows the la error as a function of S/N 
for each index set discussed in Section 12.21 in comparison to 
case 1 (dashed line). Fig. [3^ shows that log(M*/L z ) uncer- 
tainties can be narrowed already by simply using all the opti- 
cal Balmer lines (instead of only one) together with D4000 n : 
the typical statistical error can go below 0.1 dex already at 
S/N ~ 30 and reaches 0.05 dex at the ideal S/N of 200. 

In Fig. [3}} we compare the uncertainties on log(M*/L z ) es- 
timates derived by adding two composite indices that are sen- 
sitive to total metallicity (case 3) with those derived in the de- 
fault case 1. The improvement is significant in particular at 
S/N > 30 and the la error can even go below the 0.05 dex level 
at S/N > 100. This is because, at given age, the stellar mass- 
to-light ratio still depends on stellar metallicity: constraining 
the range in metallicity of the stellar populations can thus help 
to narrow the allowed range in log(M*/L z ). There is no dif- 
ference if the individual Mg and Fe lines are used instead of 



0.25 



0.2 



0.15 



0.1 



0.05 



D4,H<5 




50 100 150 

mean S/N per pixel 



200 



FIG. 2. — Uncertainty on stellar mass-to-light ratio (as quantified by half 
of the interquartile width of the likelihood distribution) as a function of the 
mean S/N per pixel of the spectrum. Points and solid line show the median 
uncertainty (<J]og(M t /Lp °f a " me mock galaxies, while error bars indicate the 
interquartile range of the distribution in cr\ g(M„/L)- Mass-to-light ratio esti- 
mates are constrained by D4000 n and H5 only. 



the composite a/Fe-independent indices (Fig. [3};). If observed 
spectra are compared to scaled-solar models the composite in- 
dices should be used in order to avoid any bias introduced by 
incorrect interpretation of a-enhanced stellar populations. If 
models that include variations in element abundance ratios are 
used instead, the individual indices may give additional infor- 
mation not only on total metallicity but also on relative abun- 
dances (and hence narrow the uncertainties). 

In Fig.[3jl we explore the usefulness of other metal-sensitive 
lines in the blue part of the optical spectrum, in addition to 
D4000 n and H5a- These features are reasonably well repro- 
duced by scaled-solar models, because of their weak depen- 
dence on [a/Fe] (except Ca4227). They have a negligible im- 
pact on the quality of log(M*/L z ) estimates and there is no im- 
provement over the use of only D4000 n and HSa, except at ex- 
tremely (and unrealistically) high S/N ratios. Interestingly, we 
found similar results (better by less than a hundredth of a dex 
at S/N> 100) by using instead Fe lines in the same region of 
the spectrum, such as Fe4531, in addition to D4000 n and H5a- 
This is due to the low resolving power of these indices and their 
broad relation with M*/L. The clear improvement seen in case 
3 (Fig. [3j>) is mainly driven by Mg lines which alone help re- 
ducing the uncertainties by almost 0.02 dex at S/N> 50. 

Finally, Fig.[3^,f explore the case in which only the red por- 
tion of the optical spectrum is available, in particular there is 
no measure of the 4000A-break. Here we use only H/3 as 
age-sensitive index and some Mg and Fe lines to constrain the 
metallicity range. Surprisingly these absorption features pro- 
vide log(M»/L z ) estimates with lower uncertainties than the 
case in which the reference age diagnostics (D4000 n and HSa) 
are used (but not better than the results obtained combining 
D4000 n and H5a with Mg and Fe lines on a longer wavelength 
baselines, such as case 3 or 4). The improvement, though, be- 
comes significant only at S/N > 50 and is comparable to case 
2 (i.e. all Balmer lines in addition to D4000 n , panel a). We note 
that, while metallicity constraints help to reduce log(M*/L z ) 
uncertainties, it is crucial to first determine stellar age (i.e. it is 
crucial to use at least an age-sensitive feature). The comparison 
between panels e and f shows that only three metal-sensitive 
indices, in addition to H/3, are sufficient to obtain uncertainties 
smaller than ~ 20% at S/N > 30 and the inclusion of other Mg 
and Fe lines provides only redundant information. We note that 
we have chosen Fe5709 and Fe5789 because of their weak de- 
pendence on a-enhancement that makes them appropriate for 
use with scaled-solar population synthesis models. If models 
with variable element abundance ratios are used other Fe lines, 
blueward of 5500A, could be used (i.e., the required wavelength 
range is narrower). Interestingly H/3 seems to perform as well 
as and even better than D4000 n in constraining log(M„/L z ). We 
checked this also by substituting in the index set of case 7 H/3 
by D4000 n . Indeed the relation between log(M*/L z ) and H/3 
in the library is much narrower than the one with D4000 n . The 
latter is particularly broad at intermediate D4000 n values. 

Overall the log(M*/L z ) uncertainty has a similar dependence 
on S/N regardless of the absorption features used. However, 
at a given S/N (above at least S/N ~ 10) the statistical error 
on log(M*/L z ) estimates can be significantly improved if con- 
straints from metal-sensitive lines on a relatively long wave- 
length baseline are included. If only features blueward of 
~ 5000A are available there is no significant improvement over 
the use of D4000,, and HSa only by adding other absorption fea- 
tures. 
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FIG. 3. — Same as Fig.|2]but using different sets of absorption indices as constraints, as indicated in each panel. The dashed line reproduces the curve of Fig.f2]for 
the case in which only D4000 n and H8^ are used. 



3.2. Dependence on SFH 

In this section we wish to explore how the uncertainties on 
the estimated log(M*/L z ) depend on the galaxy spectral type, 
or star formation history. In practise we wish to identify for 
different galaxy types the observational conditions (i.e., spec- 
tral S/N and observational constraints) under which a minimum 
accuracy can be reached. 

Before assessing the reliability of certain observational con- 
straints in estimating log(M*/L z ) we need to check how well 
the true log(M*/L z ) is retrieved. Fig. [4] shows, as a func- 
tion of S/N, the difference between the estimated and the true 
log(M*/L z ) (A] og ( M>(/ / L )) for three combinations of absorption 
features and for the different spectral types defined in Sec- 
tion 12.11 panels a, b and c represent galaxies along the main 
D4000 n -H<5 1 4 sequence characterized by a continuous SFH with 
increasing light-weighted age ('continuous, young', 'continu- 
ous, intermediate', 'continuous, old'); panels d and e represent 
instead galaxies with a bursty SFH ('bursty, young', 'bursty, 
intermediate'). In each panel we consider only three sets of 
indices that we deem most representative among those dis- 
cussed in Section [2721 We consider case 1 (D4000 n and H6a, 
solid black line) and case 3 (D4000 n , HS A , Hja, H/3, [MgFe]', 
[Mg2Fe] ; dashed dark grey line) because they have already been 
used in the literature to derive stellar masses of large samples 
of galaxies and they represent two distinct situations: case 1 
uses a minimum spectral information, while case 3 seems to 
provide on average the best accuracies on log(M*/L z ) (accord- 
ing to Fig. but requires more spectroscopic information. In 
addition, we consider also case 7 (dotted light grey line, us- 
ing only the red portion of the spectrum; H/3, Mg2, Fe5709 
and Fe5789): while avoiding the 4000A-break region, which 
is most commonly used to define stellar age, it seems to bring 
some improvement over the default case. 

In general, all the three index combinations considered per- 
form well in retrieving the true log(M*/L z ) value, at least at 



S/N > 30. At lower S/N there can be small offsets due to the 
fact that models 'further away' from the analysed galaxy have a 
significant weight in the PDF. The index set that performs best 
for all galaxy types and S/N is the one combining D4000 n and 
Balmer lines with metal-sensitive indices. Estimates based on 
absorption features redward of the 4000A-break are more eas- 
ily subject to biases of the order of 0.02 dex for bursty SFHs 
and young stellar populations. 

Fig. [5] provides a further visual representation of how well 
the index sets considered in this work can retrieve the true 
M*/L. We consider the estimates obtained at a S/N of 30 and 
we distinguish the different spectral types. As already shown 
in Fig. 2] there is on average no significant systematic bias in 
the estimated log(M*/L z ) values. However in many cases the 
offsets are correlated with the log(M*/L z ) value itself. This 
is the combined effect of the prior distribution of the models 
and the procedure of marginalizing over all models. Consider 
the log(M*/L z ) distribution of models with absorption features 
within la of the observed ones (which are those that mostly 
contribute to the PDF). If the analysed galaxy has a log(M*/L z ) 
value at the edges of this distribution, its log(M»/L z ) will be 
over- or under-estimated by an amount that depends on how 
narrow and peaked is the distribution in log(M*/L z ) of the 
models. The overall scatter and correlation between offset and 
log(M*/L z ) would be reduced at higher S/N. 

We now analyse the statistical uncertainty on M*/L as ex- 
pressed by half of the 68% confidence range of the PDF (we 
note that this is in general larger than the rms scatter of the off- 
sets shown in Fig. [4] - see discussion in Section 12. 2\ . Fig. [6] 
shows the median crjgg^ /L ) as a f unct i° n °f S/N for the 
five spectral types as in Fig. [4] It appears that the stellar 
M/L is generally more easily and better constrained for galax- 
ies dominated by old stellar populations and with smooth SFH 
(Fig. |6h,b,c). This means that fewer observational constraints 
and poorer spectral quality are sufficient to reach a given ac- 
curacy on log(M*/L z ) estimates. In particular, for old stellar 
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populations log(M*/L z ) can be constrained within ±0.05 dex 
using only D4000 n and H6 A and with a S/N of 20. Higher S/N 
or further observational constraints do not improve the uncer- 
tainties, except at very high S/N by adding metal-sensitive in- 
dices (panel c). 

Focusing on galaxies with continuous SFHs, statistical un- 
certainties on log(M*/L z ) estimated with D4000 n and H8a be- 
come larger, at a given S/N, with decreasing stellar population 
age. The uncertainties depend much less on stellar age in case 
3, i.e. when the composite metal-sensitive indices are included. 
This is because the impact on &\ g(fy[ n ) of adding metal- 

licity constraints is stronger for younger galaxies, which have 
a much larger spread in stellar metallicity (see Fig. [1). Metal 
lines are crucial in order to reach accuracies below 0. 1 dex for 
galaxies with young stellar populations at reasonable S/N ra- 
tios, and even below 0.05 dex at S/N > 100. It is also inter- 
esting to notice that, while D4000 n is crucial (and enough) to 
derive good constraints on the log(M*/L z ) of old galaxies, for 
younger stellar populations excluding D4000 n and using only 
absorption features redward of ~ 5000A can even improve 
the uncertainty at a given S/N. As mentioned before, except at 
D4000 n > 2, log(M*/L z ) correlates more tightly with H/3 than 
with D4000 n . 

We now turn our attention to galaxies that have experienced 
a recent burst of star formation, with young (< 1 Gyr) or 
intermediate-age (2-3 Gyr) stellar populations (panel d and e 
of Fig.|6j respectively). It is immediately clear that more obser- 
vational constraints and better spectral quality are required to 
reach a certain level of accuracy in log(M„/L z ) estimates with 
respect to galaxies with continuous SFHs. For 'bursty young' 
SFHs the only way to lower the statistical uncertainties below 
the 0. 1 dex level is to combine age and metallicity constraints 
(dashed line) and to acquire spectra with S/N above 40. 

The situation is even more dramatic for galaxies in which the 
young burst is not dominant but mixed to an older stellar popu- 
lation (panel e). The log(M*/L z ) of this type of galaxies cannot 
be constrained better than ±0.15-0.2 dex. Again, a combina- 
tion of H(3 and Mg and Fe lines performs better than including 
D4000 n (compare dashed and dotted lines). This class of galax- 
ies occupy the region where the relation between log(M*/L z ) 
and D4000 n is broadest. 

It is interesting to notice that, while in general higher mass- 
to-light ratio values are better determined than lower ones, 
we have checked that, at a given S/N, the uncertainty on 
log(M»/L z ) is primarily a function of galaxy spectral type and 
not of log(M»/L z ) itself. At young ages and continuous SFHs 
there is a further correlation with log(M*/L z ) in the sense that 
lower log(M*/L z ) galaxies have smaller uncertainties. 

3.3. Discreteness effects 

At this point, we wish to spend few words to discuss the reli- 
ability of error determinations at very high S/N. As can be seen 
from Fig. [6] the scatter in <7i og (M„/L) at given S/N is quite small 
for continuous SFHs, but becomes significantly large, and even 
larger than any trend of error with S/N, for bursty SFHs espe- 
cially at intermediate ages. Galaxies of similar type and with 
similar spectral quality can have very different errors that may 
even run into the limit of zero uncertainty. Such tiny uncertain- 
ties are not robust and are an indication that the sampling of 
the likelihood distribution is not reliable. This is also suggested 
by rms scatter of the median estimates (with respect to the true 
value) that remains significantly large even at high S/N. 



All the models in the library contribute to the PDF, but the 
largest weights are given to those models that lie within 1 or 2 
a of the observational data (in this case, the absorption indices 
of the analysed galaxy). It is clear then that by increasing the 
S/N (hence decreasing the observational errors) and increasing 
the number of observational constraints (the absorption features 
used to compare models and observations) fewer models enter 
the 'observational cell' of the analysed galaxy (i.e. have index 
values within la of the observed ones). The PDF is no longer 
uniformly sampled and the marginalization procedure becomes 
effectively a maximum-likelihood analysis. The point at which 
this happens depends on the number of observational data used 
but also on the galaxy type, or more precisely on the location of 
galaxies in the index-index space. This is because of the non- 
uniform distribution of the models in the D4000 n -H<5 J 4 plane: 
while discreteness effects are not a concern for old stellar pop- 
ulations and, in general, continuous SFHs (at least with such 
a large model library as the one used here) even at very high 
S/N, they severely affect our ability of building reliable PDFs 
of galaxies with bursty SFHs at S/N > 50 with more than two 
observational constraints. 

Increasing the number of models (e.g. by an order of magni- 
tude) will not improve the situation for galaxies with high HSa 
values. The statistical error on log(M*/L z ) cannot be lower 
than its intrinsic scatter at each position in the observational 
space. While old stellar populations have very uniform mass- 
to-light ratios, galaxies with bursty SFH have broader distribu- 
tions in log(M»/L z ). The scatter in log(M*/L z ) remains rela- 
tively large even reducing the dimension of the 'observational 
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FIG. 5. — Comparison between log(M*/L z ) estimates using different sets 
of absorption features and the true log(M» /L z ), against the true log(M» /L z ). 
We do not consider here the combination of indices D4000 n , H6 A , H7/1 , H/3, 
Mg 2 , Mgb, Fe4531, Fe5015, Fe5270, Fe5335, and the combination H/3, Mg 2 , 
Mgb, Fe5270, Fe5335, Fe5406, Fe5709, Fe5789 because they are essentially 
identical to case 3 (panel c) and case 7 (panel e), respectively. Different galaxy 
spectral types are distinguished using the same color coding as in Fig. [T] (red 
circles: 'continuous, old', orange squares: 'continuous, intermediate age', blue 
diamonds: 'continuous, young', green triangles: 'bursty, intermediate age', 
cyan asterisks: 'bursty, young'). 
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FIG. 4. — Comparison between the estimated log(M»/L z ) and the true log(M* /L z ) of the model galaxy (Ai og(Mt / L )) as a function of S/N and spectral type (as 
indicated in each panel). The curves with filled circles trace the median Ai og(Mt / L ) while the lower and upper curves represent the rms scatter of the distribution in 
AwM t /l) at given S/N. As in Fig. [6] three sets of spectral indices are shown (solid: D4000„ and H8a', dashed: metal-sensitive indices added; dotted: only indices 
in the red part of the spectrum). 
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FIG. 6. — Median uncertainties in log(M*/L z ) as a function of S/N for galaxies populating different regions of the D4000„-H<5 A plane and characterised by 
different SFHs, as labeled in each panel. Different lines styles compare the results obtained by using different sets of absorption indices, specifically using only 
D4000n and H5 A (solid), adding metal-sensitive indices (dashed) or using only the red part of the spectrum, i.e. excluding the 4000-break information (dotted). 
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cell' (i.e. reducing the index errors) and/or increasing the num- 
ber of constraints. This is illustrated in Fig. [7] where we plot 
the rms scatter in log(M st /L z ) along the D4000 n -H<5A plane for 
increasing S/N. We calculate the scatter in bins of D4000 n and 
HSa of width given by the typical index errors at a given S/N 
(left-hand panels). The values read from this plot are in good 
agreement with the errors on log(M*/L z ) shown in Fig. [6] It 
is clear that the region of highest intrinsic log(M*/L z ) scatter 
(between 0. 1 and 0.2 dex) is at intermediate D4000 n and high 
HSa values (except at very low D4000 n values which we do not 
consider in this work and are not covered by SDSS galaxies), at 
least up to a S/N = 100 where there are enough models (> 5) 
to measure the scatter. 

We also calculate the log(M*/L z ) rms scatter in bins of 
D4000 n , H5 A , Hy A , Hf3, [MgFe]' and [Mg 2 Fe], and plot the 
average scatter in bins of D4000 n and HSa (right-hand panels). 
Although the scatter is generally lower than in the left-hand 
panels (because of more observational constraints), the varia- 
tion in log(M„/L z ) is still quite significant for bursty SFHs. 

We thus conclude that the M*/L of intermediate-age stellar 
populations cannot be determined better than ±0.15 dex and 
there is no gain in either increasing the S/N above 20-30 or 
using more observational constraints. This is reflected in the 
rms of the offsets shown in Fig. [4] that remains at the level of 
0.12-0.15 dex (depending on index set). 

4. log(M» /L z ) ESTIMATES FROM COLORS 

In this section we address the statistical uncertainty on 
log(M*/L z ) derived from one or two optical or optical-NIR 
colors. The method adopted is the same as the one described 
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FIG. 7. — Scatter in log(M»/L z ) as a function of position in index-index 
plane and of S/N. The left-hand panels show the D4000„-H<5,4 plane color- 
coded according to the rms scatter in log(M*/L z ) of models in the library 
in bins of D4000„ and H8 A of width given by the observational error typical 
of a given S/N. We consider here S/N of 20, 50, 100, 200 from the lower to 
the upper panel. The boxes in each panel indicate the regions in which the 
five different spectral types considered in this work have been defined. The 
right-hand panels show instead, at each D4000 n -H5 A position, the average rms 
scatter in bins of D4000„, H<5 A , H"fA, H/3, [MgFe]' and [Mg 2 Fe], for the same 
S/N ratios as in the corresponding left-hand panel. 



in Section 12.21 but using as observational constraints broad- 
band colors rather than absorption features. It is important to 
remember that the model library used here does not include 
dust attenuation. While this has little influence on absorption 
indices, observed galaxy colors are strongly affected by dust. 
The uncertainties quoted here hence represent the expected sta- 
tistical uncertainties in the case of small dust attenuation (e.g. 
elliptical galaxies) or of good knowledge of dust correction. 

4. 1 . Estimates based on one color 

We first explore the log(M*/L z ) constraints derived using a 
single broad-band color. We consider in particular the optical 
colors g — r, r-i, g—i, B-V and B-R, and the optical-NIR 
colors V -I, V -J, V -H, V -K. Fig. [8] shows the average dif- 
ference between the retrieved log(M*/L z ) and the true value 
as a function of color, observational error and galaxy spectral 
type. We consider three different values of observational error 
on the color, namely 0. 1 mag (squares), 0.05 mag (circles) and 
0.02 mag (triangles). Each panel isolate model galaxies in the 
five spectral classes defined in Section lXTI For comparison the 
dashed line in each panel traces the median offset for estimates 
based on absorption indices (case 3: D4000 n , HSa, H'Ja, H(3, 
[Mg2Fe], [MgFe]') at a representative S/N = 30 for the corre- 
sponding spectral class. 

Fig.[8]shows that it is possible to retrieve the correct mass-to- 
light ratio value within ±0.05dex on average (except for bursty, 
intermediate-age galaxies) by using optical colors, if their ob- 
servational error is below < 0.05mag. They perform similarly 
to absorption indices in this respect, although for old stellar 
populations color-based values tend to be slightly underesti- 
mated with respect to index-based values. On the contrary, 
log(M*/L z ) estimates from optical-NIR colors alone tend to 
significantly under-/over-estimate the true value of old/young 
stellar populations. In Section [3~!2l and |3~3"1 we showed that spec- 
troscopic estimates of the log(M»/L z ) of galaxies characterized 
by bursty SFH and intermediate-age populations are affected 
by large statistical uncertainties and significant variation on a 
galaxy-by-galaxy basis. Fig.|8]shows that colors systematically 
overestimate the true M*/L z of these galaxies by 0.1 -0.2dex. 
This is the case for all the colors considered here. 

Fig. [9] shows the median lcr uncertainty on log(M*/L z ) as a 
function of the color adopted. It is interesting to see that us- 
ing optical colors with reasonably small observational errors 
(< 0.05mag) it is possible to constrain M*/L z almost as well 
as using a combination of age- and metal-sensitive absorption 
features (uncertainties are only ~ 0.02dex higher), with the as- 
sumption that redshift is known and neglecting dust corrections. 
Moreover the typical uncertainty cri og ( M| is less dependent 
on the galaxy spectral class than in the case of estimates ob- 
tained with absorption indices. For bursty SFHs the statisti- 
cal uncertainty of color-based log(M*/L z ) estimates is slightly 
better and more homogeneous than those of spectroscopically- 
based estimates. However Fig. |8]just showed that color-based 
estimates are systematically over-estimated by almost 0.2 dex 
for this type of objects. 

Fig.|9]shows that, among the optical colors considered here, 
the one producing the largest uncertainties is r-i because of 
its shorter dynamic range and the higher relative uncertainties 
with respect to the other colors. We note that observationally it 
is advisable, if possible, to avoid using the r band which can be 
severely contaminated by Ha emission in star-forming galax- 
ies. On the contrary, the optical color which provides in general 
the smallest uncertainties is g — i which has the largest wave- 
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length leverage. Statistical errors on log(M*/L z ) are below 
0. 1 dex for continuous models with relatively old stellar pop- 
ulations, and only slightly worse for bursty, inte rmediate-age 
model s (~ 0. 12dex). This is the color preferred bv lZibetti et al.l 
(2009) to derive mass-to-light ratio estimates in the ; band, with 
a maximum variation in median M*/L, of ~ 0.3dex at given g— i 
color. 

From Fig. [9] it is also clear that optical-NIR colors predict 
log(M*/L z ) with lower accuracy than optical colors, in partic- 
ular for young and intermediate-age objects (and tend to be bi- 
ased as shown in Fig. [8]). The relation between log(M»/L z ) 
and optical-NIR colors is indeed very broad (see Fig. ITOb , and 
optical-NIR colors are equally sensitive to age and metallicity, 
as opposed to optical colors which have a st ronger sensitivity 
to age (e.g. lde Jong| [l996: Ch ang et alJlioO^) . The uncertain- 
ties are thus expected to be larger especially for galaxies with 
a young stellar population component, which have a broader 
metallicity distribution (see Fig. |TJ. 

The trends mentioned above are clearly understood by look- 
ing at the distribution in log(M„/L z ) as a function of color for 
the whole Monte Carlo library (grey) and the analysed mod- 
els (colored symbols) shown in Fig. [10] The log(M*/L z )- 
optical color relation is very tight and models of different spec- 
tral class form a sequence of increasing log(M*/L z ) (and age) 
with increasing color. This and the relatively narrow color 
distribution of individual spectral classes explain i) why op- 
tical colors alone can constrain log(M*/L z ) with accuracy al- 
most as good as the one reached from spectroscopic estimates, 
and ii) the small variation in o\ og (M,/L-) both as a function of 
spectral type and within individual classes. The relation be- 
comes significantly broader for optical-NIR colors (which have 
a stronger secondary dependence on metallicity) and the spec- 
troscopic selection does not correspond anymore to a color se- 
lection. This explains the larger uncertainties and the larger 
spread in <7i og (Af„/L-) f° r estimates based on optical-NIR colors. 
Figure [10] also explains the large offsets seen in Fig. [8]) in par- 
ticular for 'bursty/intermediate' galaxies. When building the 
log(M*/L z ) likelihood distribution of individual galaxies, most 
of the weight comes from all the models in the library with simi- 
lar color as the analysed one and hence the derived log(M*/L z ) 
will rather represent the median log(M*/L z ) at a given color. 
The offset between the estimated and the true log(M*/L z ) orig- 
inates from the fact that 'bursty, intermediate' models populate 
the low-(M*/L) tail of the distribution of all models at similar 
color (as shown in Fig.fTOl). The effect is stronger and extended 
to other spectral classes for optical-NIR colors which have a 
broader log(M*/L z ) distribution at fixed color. 

4.2. Comparison with Bell et al. 2003 

We have also compared our Bay esian log(M*/L z ) estimates 
with those obtained adopting the Bell et al. (2003) formulae 
(see their Table 7) for SDSS colors, accounting for the different 
IMF. We note th at, while th e re is g ood agreement for old stellar 
populations, the iBell et all d2003i) estimates tend to be higher 
than ours at blu er colors (g - i < 1). Indeed the log(M*/L z )- 
color relation o f IBell et alJ d2003[) is flatter than the one we see 
in our model li brary. This is expl icitly shown in Fig.QT| where 
we compare the lBell et al.l d2003l) relation between log(M*/L z ) 
and g — i (solid line; the dotted line extrapolates their relation 
to the whole color range of our library) and the distribution of 
our models (grey contours). The color-shaded region shows 
for clarity only models with exponential SFHs (without bursts) 
coded according to their formation time (log(tf olm /yr)). As 



discussed in more depth by de Jong & B ell (2009 , in pre p.), 
the difference in slope between our and IBell etalJd2003h re- 
lation is primarily due to the age distribution of the models: 
at bluer colors the SFH distribution is skewed towards sub- 
stantially younger ages for the set of model temp lates used 
here c ompared to the SFH distributions assumed by Bell et al. 
(120031) . The relation between log(M*/L z ) and g — i for mod- 
lOGyr is in good agreement with their relation 
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for a 12 Gyr old stellar population. At fixed younger formation 
age only the zero-point moves to bluer color, while the slope 
does not change. This produces a steeper age-averaged relation 
(dashed line). 

Finally, we note that dust also has a secondary effect. The 
red arrow in Fig. QT] gives an impression of the effect on 
g — i and z-band M*/L on young stellar populations assuming 
Ay = lmag and an extinction curve A a oc A~°' 7 . de Jong & 
Bell (2009 , in prep.) compared stellar mass-to-light ratios cal- 
culated by Gallazz i et al.l d2005l) using these stellar population 
model s (based on line indices) with those derived by Bell et al. 
d2003l) . While the intrinsic M*/L (i.e. the mass-to-light ratio of 
the stars alone) from both methods differ, the total M*/L (i.e. 
the mass-to-light ratio that one applies to the observed light to 
estimate the stellar mass, which includes the effects of dust ex- 
tinction) of both methods match well. Dust reddens and dims 
the yo ungest stellar populat ions, bringing the total M*/L val- 
ues of lGallazzi etal.1 d2005l) and IBell et all d2003l) into better 
agreement. A similar discussion of the sensitivity of the M*/L 
distribution as a function of color to the distribution in age of 
the library (formation time, intensity a nd duration of bursts ) and 
to dust has been recently presented by Zibe tti et"ail ((2009) who 
use a model library similar to ours (but with a larger fraction of 
recent bursts), explicitly including dust attenuation. 
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FIG. 10. — Distribution of stellar log(M»/L z ) as a function of color. The 
grey shaded region represents the coverage of the Monte Carlo library of SFHs. 
Colored symbols indicate the location of the analysed models with different 
D4000 n -H^ values (red circles: 'continuous, old', orange squares: 'contin- 
uous, intermediate', green triangles: 'bursty, intermediate', blue diamonds: 
'continuous, young', cyan asterisks: 'bursty, young'). 
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FIG. 8. — Median offset between the estimated log(M*/L z ) and the true value as a function of the color used as constraint. For each color, typical errors 
of 0.1 mag (circles), 0.05 mag (circles) and 0.02 mag (triangles) are considered. Each panel refers to a particular spectral type ((a): 'continuous, young', (b): 
'continuous, intermediate', (c): 'continuous, old', (d): 'bursty, young', (e): 'bursty, intermediate'). The error bars represent the rms of the distribution in A tag ( Mt( / Lz ) 
at given S/N. The dashed line represents the median offset for spectroscopic estimates at S/N= 30. 
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FIG. 9. — Median error on the log(M*/L z ) estimates derived from optical or optical-NIR colors for the five spectral types (in the same order as in Fig. [9}. The 
error bars indicate the scatter in error (84— 16 interpercentile range). The dashed line in each panel indicates the typical crj g( M> n-^ obtained from spectral absorption 
features (case 3) at S/N = 30 for the different spectral types. 
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4.3. Estimates based on two colors 

We expect that the uncertainties on photometric log(M*/L z ) 
estimates can be reduced by combining information from more 
than one color that bring additional and distinct information on 
the underlying stellar populations. However, given the system- 
atic flux uncertainties still present in the models and the nearly- 
degenerate color information, it is found that using more than 
two colors does not lower the uncertainties on the derived M* /L 
(Zib etti et al.ll2009h . Therefore we restrict the following anal- 
ysis to the uncertainties on log(M»/L z ) estimates based on a 
combination of two colors. Specifically, we consider combi- 
nations that involve the g—i color (which provides on average 
the smallest uncertainties and the most stable results) in con- 
junction with 1) u-g, which probes shorter wavelengths and 
brackets the 4000A-break, 2) i— z probing redder wavelengths, 
or 3) i—H which extends into the NIR and offers a longer wave- 
length leverage. We further consider other two color combina- 
tions similar in spirit to g—i,i—H: B — V ,V — H and V — IJ—K. 

Fig-Elshows the difference between the estimated log(M*/L z ) 
and the true one as a function of the five two-color combina- 
tions considered here for different spectral classes as in the 
previous figures. Fig. [T3l shows the median cr log ( Mi) , / L j and the 
associated scatter. We reproduce for comparison the equivalent 
quantities for spectroscopic (case 3, dashed line) and one-color 
estimates (g — i, dotted line). 

From these two plots we can see that improving the pho- 
tometric quality of the data have a larger impact on the 
log(M„/L z ) uncertainty associated to optical-NIR color-based 
estimates, probably because of their lower degree of degener- 
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FIG. 11. — Relation between g — i color and M-*/L z for the model library 
used here. The contours show the distribution for the library as a whole and 
the dashed line the relation obtained from a simple linear fit. The solid line 
shows the relation given in Bell et al. ( 2003) for a 12 Gyr old stellar population 
extrapolated to the full range of color covered by our library (dotted line). 
The color-coded area represent the distribution of the models in our library 
with exponential SFH (i.e. without bursts) as a function of the time of onset 
of star formation (tf orm ). At fixed formation time the relation between color 
and M t /L z is shallower than the global rela tion (i.e. av eraged over all SFHs) 
and in agreement with the relation found by Bell et al. 1 2003). The red arrow 
illustrates the effect of dust attenuation on g-i and the z-band M„ / L assuming 
Ay = lmag and an extinction curve A^ oc A~ 7 . 



acy, with respect to optical color-based estimates. In general 
we notice that the combination of colors that performs better is 
g-i,i-H. However, assuming an uncertainty of 0.05 mag on 
all colors, there is overall no significant improvement by using 
two colors with respect to g — i only (compare circ les with dot- 
ted lin e). This is in agreement with the finding of IZibetti et alj 
(2009) that the mass-to-light ratio in /-band (or equivalently in 
z-band, as used here) has little residual dependence on optical- 
NIR colors. We stress that we do not include dust here, which 
is expected to increase the scatter in /L at fixed color, h ence 
increase the statistical uncertainty (see IZibetti et al.l l2009. for 
dust effects). 

At fixed photometric accuracy, different color combinations 
seem to perform similarly. This is not the case for galaxies with 
bursty SFHs for which optical-only colors [u-g, g-i) seem to 
provide more uncertain estimates. In the case of bursty stellar 
populations, two-color estimates, even with quite poor photo- 
metric quality, seem to be affected by statistical uncertainties 
similar to or smaller than spectroscopic estimates (with reason- 
able S/N of 30). However, they are still affected by significant 
bias. 

Finally, in Fig. [14] we compare our preferred spectroscopic 
log(M*/L z ) estimates, based on a combination of age- and 
metal-sensitive absorption indices (case 3), with photometric 
estimates from g— i alone (lower panel) and the combination g— 
i, i-H (upper panel). The plot shows that one-color log(M*/L z ) 
estimates agree well with spectroscopic ones within the respec- 
tive uncertainties. This is true for all spectral classes, except for 
intermediate-age models with a bursty SFH for which we have 
shown that Bayesian estimates based on color tend to be biased 
high. The upper panel of Fig. [14] shows that complementing 
g—i with information on a longer wavelength baseline extend- 
ing to the NIR further improves the comparison with spectro- 
scopic estimates (and with the true log(M*/L z )) by reducing 
biases, although the statistical uncertainty is not appreciably re- 
duced. Again, we note that although the statistical uncertainty 
for 'bursty, intermediate' types is lower in photometric than 
spectroscopic estimates, adding the i - H color is not enough 
to remove the bias in the log(M*/L z ) photometric estimates. 
This type of galaxies is intrinsically more difficult to constrain. 

5. EFFECTS OF THE ASSUMED SFH AND METALLICITY PRIOR 

So far we have analysed mock galaxies that are drawn from 
the same parent library with which their spectra are fitted. This 
represents the ideal case in which the prior distribution in SFHs 
and metallicities of the model library perfectly matches the re- 
ality. Despite the very broad range of SFHs covered by our li- 
brary, this situation likely does not happen when analysing real 
data. Moreover for each model SFH we assume a fixed metal- 
licity that does not evolve with time. This is certainly different 
from reality. In this section we attempt to quantify how wrong 
the estimated M*/L can be by assuming a prior that does not 
match the real distribution in SFH and metallicity. 

We first explore the effects on the estimated M*/L of a mis- 
match in SFH mix, in particular in the fraction of bursts of 
star formation. To do this we consider two additional model 
libraries: i) a library of continuous models, in which no burst 
is added on top of the exponentially declining SFR, and ii) a 
'burst-enhanced' library, in which 50% of the models are al- 
lowed to have a burst in the last 2 Gyr. We then use these li- 
braries, separately, to analyse the mock galaxies extracted from 
our default library (in which 10% of the models have a burst in 
the last 2 Gyr). 
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FIG. 12. — Median offset between the estimated log(M»/L z ) and the true 
value as a function of spectral type (in the same order as in Fig.[5J and for dif- 
ferent combinations of optical/optical-NIR colors. The typical offsets for spec- 
troscopic estimates (case 3, S/N=30) and one-color estimates (g—i, 0.05 mag 
error) are indicated with dashed and dotted lines respectively. 
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FIG. 14. — Comparison between log(M*/L z ) estimated from spectral ab- 
sorption features (namely D4000 n , H/3, H<5 A , H7A, [MgFe]', [Mg 2 Fe] with 
S/N=30) and from g — i color (lower panel) or g — i,i-H (upper panel) as- 
suming 0.05 mag error. The solid line indicates the expected 1:1 relation. 
Different colors indicate different spectral types (as in Fig. [TJ. The errorbars 
centered at the average log(M»/L z ) of each galaxy type indicate the average 
error on spectroscopic and photometric estimates. Photometric and spectro- 
scopic log(M„ /L z ) estimates agree well within the respective errors, except 
for bursty, intermediate-age galaxy types, for which g—i (as well as other col- 
ors) tend to overpredict the true log(M* /L z ) by ~ 0.15 dex (see Fig. [8). 
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FIG. 13. — Median error on the log(M„/L z ) estimates derived from two 
colors in the optical and near-IR. Symbols and lines have the same meaning as 
in Fig. [9] We additionally indicate with a dotted line the median log(M»/L z ) 
uncertainty in the case of g—i color with 0.05 mag error for each spectral class. 
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Figure [TBI shows the offset from the true log(M*/L z ) of the 
value estimated by fitting the absorption features D4000 n , HSa, 
H"fA, H/3, [MgFe]', [Mg2Fe](case 3) of the mock galaxies with 
either continuous models (left-hand plot) or with the burst- 
enhanced library (right-hand plot). For comparison we repro- 
duce from Fig. [4] the median and rms of the offsets when the 
mock galaxies are analysed with the same parent library (grey 
shaded region). A library without bursts cannot reproduce high 
US A values at fixed D4000 n , i.e. the region of the D4000 n - 
HSa plane occupied by models with recent bursts of star for- 
mation (see Fig. [TJ. As a result, the estimated log(M*/L z ) of 
'bursty' galaxies suffers of larger uncertainties and biases in 
particular for intermediate-age galaxies. Clearly, the situation 
becomes worse by increasing the spectral S/N. We note that 
in order to reproduce the observed distribution of SDSS galax- 
ies in the D4000 n -H<5A plane it is necessary to introduce recent 
bursts of star formation. As opposed to this case, a library that 
over-represents recent bursts would tend to underestimate the 
log(M*/L z ) of older galaxies dominated by continuous SFH. 
Instead, no significant difference is seen from the default ideal 
case in the median offset and rms for 'bursty' galaxies. 

We repeat the analysis in Fig.[16]using as observational con- 
straint the g-i color instead of the absorption features. The 
shaded region represents the range of offsets in the default case 
(adapted from Fig. [8]), while the solid and dashed lines refer 
to the case in which the mock galaxies are fitted with a li- 
brary without bursts (red) or with 50% of bursts (blue). As- 
suming continuous only SFHs does not introduce additional bi- 
ases in the log(M*/L z ) estimates of 'bursty' galaxies, while 
slightly improves the results for 'continuous' galaxies. On 
the other hand, an increased fraction of models with recent 
bursts helps to reduce the bias in the log(M*/L z ) estimates 
of 'bursty/intermediate' galaxies (as can be understood from 
Fig. [Toll, but tends to underestimate the log(M*/L z ) of 'contin- 
uous' galaxies by 0.05-0.1 dex. 

We now explore the effects of assuming a constant metal- 
licity along the SFH. We generate a Monte Carlo library with 
the same distribution in SFHs as our default library, but allow- 
ing for chemical evolution. The chemical evolution is mod- 
eled by a metallicity increasing from 20% of a randomly drawn 
Z value (in the range 0.2-2 x Z©) until one third of the to- 
tal stellar mass produced by the underlying exponential SFH is 
formed, up to 2 x Z when two thirds of the mass are formed. 
With this simple (and probably extreme) toy model of chemical 
evolution we do not intend to reproduce the possibly very com- 
plex chemical evolution of real galaxies, but only to allow for a 
mix in metallicities within each individual model. We note that 
this library does not reach D4000 n values higher than ~ 1.9, 
and hence does not properly represents observed SDSS galaxies 
with higher D4000 n , probably because higher (mass-weighted) 
metallicities would be required. 

We now generate from this variable-metallicity library mock 
galaxies representing different regions of the D4000 n -H(5 J 4 
plane, hence different typical SFHs, as in the default case (we 
use the same ranges as in Fig. [TJ but lower D4000 n values for 
the 'continuous/old' case). We then fit their spectral features 
(case 3) with our default library in which the metallicity is con- 
stant along the SFH. The result is shown in Fig. [17] where one 
can see that no significant offset in the estimated k^M^/Lz) is 
introduced by assuming a fixed metallicity, at least above S/N 
of 20 - 30. We note however a possible mild underestimate of 
the M*/L of old galaxies and increased uncertainties (as shown 



by the rms of Aj Q g^ /L )-* ^ or g a l ax i es w i m high HSa values. 

Finally, Fig. [T8l shows the results when the g-i color is 
used to constrain the M*/L instead of the absorption features. 
The color of single-metallicity stellar populations does not cor- 
rectly predict the M*/L of composite populations with differ- 
ent metallicities: there is a tendency to underestimate (overes- 
timate) the true log(M*/L z ) of old (young) galaxies by about 
0.05 dex more in comparison to the default case (i.e., when 
fixed metallicity is assumed both in the mock galaxies and in 
the fitting library). 

We note that the M*/L increases both with increasing age 
and with increasing metallicity of the stellar populations. If 
metallicity evolves along the SFH in such a way that old stars 
are metal-poorer than young ones (as in our toy model) the age 
and metallicity effects on M*/L may compensate at some level. 
If the age and metallicity effects on spectral indices compen- 
sate at the same level then we should not expect to introduce a 
bias by assuming constant metallicity. Indeed, we checked that 
M*/L varies along the D4000 n -H<5A plane, for instance, in the 
same way for both the constant-metallicity and the variable- 
metallicity libraries, so that certain values of absorption fea- 
tures are associated to the same average M*/L z regardless of 
chemical evolution. This is not the case for broad-band colors, 
which have a different response to metallicity and age variations 
than M^/L. The relation between M*/L z and g-i color is nar- 
rower and slightly steeper for models with evolving metallicity 
than for those with fixed metallicity. This explains qualitatively 
why the M^/L of blue (young) galaxies is overestimated, while 
the M*/L of red (old) galaxies is underestimated. 

We conclude that mismatches between the assumed and the 
true distribution in SFHs can affect M*/L estimates: while in 
order to reproduce the whole range of observed spectral fea- 
tures it is necessary to allow for recent bursts of star formation, 
over-representing them can lead to underestimate the M*/L of 
galaxies dominated by continuous past SFH. The effect is more 
significant for color-based M*/L estimates. Moreover color- 
based M*/L estimates are more susceptible of biases than spec- 
troscopic ones if constant metallicity is assumed. 

6. CONCLUSIONS 
6.1. Recap 

In this work we have explored the statistical uncertainties, 
isolating the effects of SFH and metallicity variations, affecting 
stellar mass-to-light ratio estimates derived by comparing ei- 
ther stellar absorption features or colors with predictions from 
a comprehensive library of SFHs. In particular we have ad- 
dressed the question of which spectroscopic constraints and 
which spectral accuracy (as quantified by the S/N) are neces- 
sary/helpful to reduce the uncertainties as a function of galaxy 
type. 

We made us e of recent medium-resol ution population syn- 
thesis models dBruzual & Charlol 120031) to generate spectral 
absorption indices and broad-band colors for a Monte Carlo 
library of SFHs characterized by both exponentially declin- 
ing and bursty SFHs. We randomly selected ~ 550 model 
galaxies in order to cover different positions in the D4000 n - 
HSa plane which represent different average SFHs: continuous 
SFHs following a sequence of increasing HSa (and decreasing 
stellar age) at decreasing D4000 n , and bursty SFHs identified 
by higher HSa values at fixed D4000 n . We then generated sam- 
ples of mock galaxies at different S/N by perturbing the ab- 
sorption indices of the selected models according to the typical 
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FIG . 15. — Offset between the retrieved and the true log(M» /L z ) as a function of S/N for the five different spectral classes (as indicated in each panel). log(M, /L z ) 
estimates are based on the spectral features D4000 n , the Balmer lines, [MgFe]' and [Mg2Fe]. The white line and the grey shaded region indicate the median and the 
rms, respectively, of A\ g(M t ) wnen me moc k galaxies are fitted with the same model library from which they are drawn (as in Fig.|4). In the bottom panels, 
the solid and dashed lines show the median and rms of the offsets for the case in which the mock galaxies are analysed with a library without bursts. In the top 
panels, the black lines represent the results of fitting the mock galaxies with a burst-enhanced library. 




FIG. 16. — Offset between the log(M* /L z ) estimated by fitting g — i color and the true one as a function of the error on color (decreasing from left to right). The 
shaded region represents the distribution of offsets (median and rms) in the default case as shown in Fig. [8] The solid and dashed lines show the median and rms of 
the offsets when the mock galaxies are analysed with a library without bursts (bottom panels) or with higher probability of recent bursts (top panels). 
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FIG. 17. — Offset between the retrieved and the true log(M»/L z ) as a 
function of S/N. Mock galaxies are constructed allowing metallicity evolution 
along the SFH but are fitted with our default library with constant metallic- 
ity. Estimates are based on the spectral features D4000 n , the Balmer lines, 
[MgFe]' and [Mg 2 Fe]. 
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FIG. 18. — Offset between the log(M»/L z ) retrieved using g—i color as 
constraint and the true log(M* /L z ) as a function of error on color. Mock 
galaxies are constructed allowing metallicity evolution along the SFH but are 
fitted with our default library with constant metallicity (the solid line indicates 
the median offset while the dashed lines enclose ±rms scatter). The white 
line and shaded regions refer to the case in which no chemical evolution is 
considered both in the mock galaxies and in the fitting library (as derived from 
Fig.®. 



observational error at that S/N. For each mock galaxy we es- 
timated the likelihood distribution of its z-band mass-to-light 
ratio (log(M*/L z )) by comparing different sets of absorption 
indices with those of all the models in the library. We quantify 
the accuracy on M*/L estimates by both the median offset from 
the true M*/L and the median 68% likelihood interval of M*/L 
computed from the PDF. 

We have considered different combinations of absorption 
indices by including age-sensitive indices only (D4000 n and 
Balmer lines), adding metal-sensitive indices (either extending 
to red wavelengths in order to include Mg and Fe lines, or only 
in the blue portion of the spectrum < 5000A), and using only 
the red portion of the optical spectrum (in particular excluding 
D4000 n ). 

The uncertainty on log(M„/L z ) estimates shows a similar de- 
pendence on S/N regardless of the spectral features used, de- 
creasing significantly with S/N up to a S/N of 30 and then only 
mildly. In particular, for the sample as a whole, a S/N per 
pixel of at least ~ 30 is required to constrain log(M >t /L z ) to 
within better than 30% when using age-sensitive indices only 
(D4000 n and Balmer lines). However there is no significant 
gain by increasing further the S/N: there seems to be a limiting 
uncertainty of 0.07 dex, when using D4000 n and HSa only, or 
0.05 dex when using all the three Balmer lines in addition to 
D4000 n . At given S/N the best results are obtained by com- 
bining D4000 n and Balmer lines with Mg and Fe features on a 
relatively long wavelength baseline, which constrain not only 
the age of the stellar populations but also their metallicity (and 
thus narrow the allowed log(M„/L z ) range). In this case, errors 
below 0.05 dex could be reached with S/N> 50 spectra. When 
only spectra below ~ 5000A are available, there is no signif- 
icant gain in using additional absorption features with respect 
to D4000 n and HSa only. On the other hand, when only the 
red portion of the spectrum is available (hence no measure of 
D4000 n is possible) similar results on k^lVU/Lz) are obtained 
by combining H(3 with metal-sensitive features. 

By distinguishing galaxy spectral type, we find in general 
that log(M >t /L z ) is easier to constrain (i.e. lower S/N and 
fewer observational constraints are required) for old stellar pop- 
ulations: for example, uncertainties below ±0.05dex can be 
reached at S/N~ 20 with D4000 n and HS A only. When us- 
ing D4000 n and HSa only, <J\ ot {M,/L-) depends on stellar age 
in the sense that it becomes larger for galaxies with younger 
luminosity-weighted ages. For such galaxies, in particular 
those dominated by a recent burst of star formation, which have 
a broader metallicity distribution, it is crucial to add constraints 
from metal-sensitive lines in order to reach accuracy < O.ldex 
at reasonable S/N. 

The stellar mass-to-light ratio of galaxies at intermediate 
D4000 n and high HSa values, characterized by recent bursts 
of SF and intermediate-age populations, cannot be constrained 
from spectroscopy within better than 0.15-0.2dex. The ability 
of reducing oi og ( M) /y f° r galaxies with bursty SFHs (in gen- 
eral with high HSa values) is also limited by discreteness ef- 
fects which come into play at S/N> 50 and using more than 
two observational constraints, in which cases the PDF cannot 
be reliably sampled. Even if there were enough models to build 
a robust PDF, the intrinsic scatter in log(M*/L z ) at fixed index- 
index values is between 0.1 and 0.2 dex at intermediate D4000 n 
and high HSa- We conclude that for this type of galaxies there 
is no gain in increasing the S/N above ~ 30 or using more ob- 
servational constraints. 
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Applying the same Bayesian method to derive log(M*/L z ) 
estimates from colors shows that, in the assumption that red- 
shift is known and neglecting dust effects, it is possible to 
reach an accuracy only slightly worse than with absorption in- 
dices. However a photometric accuracy of at least 0.05 mag 
is required. The smallest uncertainties are obtained with g — i 
which has the widest wavelength leverage. As opposed to op- 
tical colors, optical-NIR colors of similar photometric qual- 
ity provide larger uncertainties because they show a broader 
relation with log(M*/L z ) and they depend on both age and 
metallicity. Moreover they tend to under-/over-estimate the true 
log(M*/L z ) for old/young populations, respectively. 

We note that, while (optical) colors provide similar or smaller 
(and more homogeneous) uncertainties for bursty, intermediate- 
age galaxies, they systematically overestimate their log(M,H/L z ) 
by ~ 0.2dex. There is no significant gain in log(M*/L z ) ac- 
curacy by adding another color to g - i. Although uncertain- 
ties are slightly smaller by combining g - i with a NIR color 
(such as i—H), this is not enough to remove the bias for bursty, 
intermediate-age galaxies. 

6.2. The Bottom Line 

We conclude that: 

• The mass-to-light ratio of galaxies dominated by old 
stellar populations or with smooth SFH is in general bet- 
ter constrained by spectroscopic absorption indices (in 
particular combining age- and metal-sensitive features 
for young stellar populations) both in terms of retriev- 
ing the correct value and in terms of statistical uncer- 
tainty. An accuracy below 0. 1 dex is reached already at 
S/N> 20 and it can go below 0.05 dex at higher S/N. 

• For galaxies with smooth SFH and young stellar pop- 
ulations one optical color or a combination of an op- 
tical and an optical-NIR color provides only slightly 
worse statistical uncertainties on log(M*/L z ) than spec- 
troscopic constraints, in the assumption that redshift is 
known and dust is not important, and when photometric 
errors are below ~ 0.05mag. 

• For galaxies which have experienced a recent burst of 
SF (identified by high US A values at fixed D4000J we 
are limited by either large uncertainties (~ 0.15dex) on 
spectroscopic estimates or biased estimates from colors 
(overestimated by up to 0.2 dex). 

Another important conclusion is that mismatches in the as- 
sumed SFH and metallicity distribution can have an impact on 
the estimated M*/L. It is necessary to include burst of SF in 
order to reproduce the whole range of observables. However, 
over-representing them can lead to underestimate the M^/L of 
old stellar populations by < 0.05 dex if based on absorption 
features or up to 0.1 dex if based on colors. Perhaps surpris- 
ingly, the assumption of constant metallicity does not affect the 
M*/L estimates from spectra, while it may introduce significant 
biases in color-based estimates. 

6.3. Recommendations and Practical Considerations 

We finish by highlighting a few practical issues, with some 
emphasis on responses to the particular questions posed in the 
introduction to this paper. 

One of the prime motivations of this paper was to understand 
to which extent data quality affects the determination of M*/L. 



If high-quality data allowed one to uniquely select a best-fit 
model out of a sample of more than a hundred thousand SFHs, 
then relatively little thought needs to be put into the construc- 
tion of such a model template set for M*/Ls (in particular in 
the exact distribution of different SFHs and metallicities). Un- 
fortunately, the results of this paper have argued against this 
perspective. In at least few cases, the stellar M/L values one 
gets depend sensitively on the exact mix of SFHs in the model 
template set. The first, perhaps obvious, consideration is that 
the template set should cover the full range of possible SFHs 
in order to be applicable to all galaxy spectral types. This be- 
comes critical when good quality data are available. In par- 
ticular, recent bursts of SF should be well sampled in order to 
analyse intermediate-age bursty galaxies, where there is con- 
siderable bias and scatter in estimated M*/L, even at very high 
S/N and using a wide range of spectroscopic indices to con- 
strain the model fits (summarized in, e.g., Figs. [6] and [14] On 
the other hand, the exact fraction of bursts assumed affects the 
M*/L estimates of old stellar populations (Fig.[T5land [TBI . An- 
other case is illustrated in Fig. [TT] where it is clear that the 
time at which star formation is assumed to start makes a con- 
siderable difference to (at least) color-derived stellar M/L. In all 
these cases, any mismatch between the mix of SFHs in the input 
template set and the actual Universe at the redshift of interest 
will manifest itself through biases and increased scatter in stel- 
lar M/L estimates. Our analysis also shows that, while the as- 
sumption of single-metallicity stellar populations appears not to 
affect spectroscopic M*/L estimates (as long as the range in as- 
sumed metallicities encompass the range of true mass-weighted 
metallicities), it can introduce significant biases in color-based 
estimates. These considerations underline one of the key con- 
clusions of this paper, that the mix of star formation histories 
and metallicities of the input template set should match as well 
as possible the actual distribution of star formation histories and 
metallicities in order to produce optimal stellar M/L values. 

A second motivation was to try to understand if and where 
model development acquires lower priority and urgency, at 
least from the perspective of stellar mass estimates. There are 
parts of parameter space where plausible data quality could 
yield stellar masses accurate to ~ 0.03 dex, in the absence of 
any systematic mismatches between the model and data. This 
potentially very high level of accuracy is of course currently 
out of reach, given the systematic unc ertainties inherent to 
current stellar popu lation modeling (e.g. iMaraston et alj r2006: 
IConrov et alj |2008). Accordingly, it has to be concluded that 
model development and systematic error control should remain 
a very high priority for those wishing to understand the stellar 
mass content of the Universe. 

A final motivation was to understand what kind of data is 
necessary or sufficient to produce accurate stellar M/L values. 
We cannot and do not offer a definitive answer to this question 
in this paper, for at least two reasons. Firstly, constraint of im- 
portant sources of systematic stellar population model uncer- 
tainty (e.g., importance of TP-AGB stars to the near-infrared 
light or contribution of BHB stars to the optical light of old 
stellar populations) may require additional data not considered 
here. Secondly, dust has been neglected here, and further in- 
vestigation may conclude that more data will be required to 
model its effect properly than are required simply to constrain 
the stellar populations. Notwithstanding these caveats, we find 
that, while in general spectroscopic information is superior in 
constraining a galaxy SFH and hence its M*/L, in the case of 
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relatively smooth star formation histories optical colors alone 
are enough to constrain the stellar M/L to reasonable accuracy 
(0.05-O.ldex). The addition of extra broad-band information 
does not offer large improvements. An obvious and important 
case in which the addition of spectral information can improve 
the quality of M*/L estimates is where there is highly-structured 
SFH in the last l-2Gyr, where Balmer line information can 
be critical in minimizing the bias in inferred stellar M/L value. 
Moreover, according to our analysis, spectroscopic M*/L esti- 
mates appear preferable to color-based ones in that they are less 
sensitive to mismatches between the assumed and the true SFH 
distribution and the assumption of constant metallicity in time. 
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